Building Resilient Systems on AWS: Learn how to design and implement a resilient, highly available, fault-tolerant infrastructure on AWS.

Compress Your XHTML Page Output Using PHP Output Buffers

By David Walsh on November 27, 2007

Us programmers love to keep our code readable and neat, which is why we use hundreds of tabs and line feeds in our code. The problem with doing this in the web programming world is that our users don't care about our neatly indented code. They don't care about our detailed code comments. They certainly don't care about the two-vs-three-vs-four spaces in a tab debate. So why should we make them download the extra code? With some quick PHP code, you don't need to.

The PHP Code

The entire system works on some really short PHP code:

<?php
	/*  start the output buffer  */
	ob_start('compress_page');

	/*  xhtml code below  */
?>
<!-- all xhtml content here -->
<?php
	/*  end the buffer, echo the page content  */
	ob_end_flush();

	/*  function that gets rid of tabs, line breaks, and extra spaces  */
	function compress_page($buffer) {
		$search = array('/>[^S ]+/s', '/[^S ]+</s', '/(s)+/s');
		$replace = array('>', '<', '1');
		return preg_replace($search, $replace, $buffer);
	}
?>

That's it. Compressing your page is easy to implement, especially if you use an MVC system for your website.

Recent Features

By David WalshDecember 6, 2012
Vibration API
Many of the new APIs provided to us by browser vendors are more targeted toward the mobile user than the desktop user. One of those simple APIs the Vibration API. The Vibration API allows developers to direct the device, using JavaScript, to vibrate in...
By David WalshJuly 1, 2013
9 Mind-Blowing Canvas Demos
The <canvas> element has been a revelation for the visual experts among our ranks. Canvas provides the means for incredible and efficient animations with the added bonus of no Flash; these developers can flash their awesome JavaScript skills instead. Here are nine unbelievable canvas demos that...

Incredible Demos

By David WalshAugust 22, 2010
Dijit’s TabContainer Layout: Easy Tabbed Content
One of Dojo's major advantages over other JavaScript toolkits is its Dijit library. Dijit is a UI framework comprised of JavaScript widget classes, CSS files, and HTML templates. One very useful layout class is the TabContainer. TabContainer allows you to quickly create a tabbed content...
By David WalshDecember 7, 2009
Create a Sexy Persistent Header with Opacity Using MooTools or jQuery
I've been working with the Magento eCommerce solution a lot lately and I've taken a liking to a technique they use with the top bar within their administrative control panel. When the user scrolls below a specified threshold, the top bar becomes attached to the...

Discussion

Mark Sanborn
I might be interested in something like this. I wonder however, about the effects it has on load time and cpu cycles on the server. Do you know if larger sites are using this?

Mark
David Duymelinck
I used the code to add a compression option to the CodeIgniter framework but i noticed your regular expressions removed begin tags and content. So i adjusted it;
```
$search = array('/>[\n\r ]+/s','/[\t ]+/s');
$replace = array('>','
```
david
@Mark: The only “larger” site I know of that crunches their HTML code is a small upstart called “Google” ;). If you view their search result source, you’ll see that most the page is on one line.

My thought has always been that if you need another server to handle this, get one (of course, only if it’s feasible). User’s shouldn’t have to carry this burden.

@David: I’ll take a look at what you’ve presented later today.
Mark Sanborn
Fair enough :) I will be testing/adding this to my sites soon.
Eric Wendelin
This is really, REALLY something a lot more sites should be doing. It’s so simple and effective.
Louis
This study points out how useless it is to compress HTML on common websites/weblogs.

I hope it helps.
Shaun
I also noticed the regular expressions removed begin tags and content.
I tried @david’s suggestion and and that didn’t work either, this did:
```
$search = array("/>[[:space:]]+/", "/[[:space:]]+</");
$replace = array(">","<");
```
Shaun
Nouman Saleem
@ louis – that study was done using html tidy, all that did was remove empty lines – not place it on a single line.
Louis
@Nouman: I don’t understand your remark. Your PHP code “gets rid of tabs, line breaks, and extra spaces”, and as I initially pointed out, that is not a relevant thing to do to optimize loading times and co.
Nouman Saleem
@ loius as I said that was done using html tidy which doesnt do what this script does, unless Im mistaken. This script makes it like the google source code where the files is compressed way further than what html tidy can do
Louis
@Nouman: what is “way further” exactly? If it’s just the removal of the newlines, I don’t believe that it will be enough to say that the same conclusion as the study I pointed to doesn’t apply here.

The removal of human-dedicated characters is not interesting when digging out to optimize performances because (a) there’s too little to win for normal size webpages, and (b) the gzip compression make advantage of the redundance of these special characters and thus compress the page effisciently enough.
Nolochemical
Neat! Very effective, perfect for inline documentation.
Davin Bergstrom
@Shaun: same problem, you solution worked.

Shaun

I rewrote the algorithm as My solution was incomplete (remove comments, tabs, spaces, newlines, etc):

compress_page($buffer) {
$search = array(
    "/ +/" => " ",
    "/<!--\{(.*?)\}-->|<!--(.*?)-->|[\t\r\n]|<!--|-->|\/\/ <!--|\/\/ -->|<!\[CDATA\[|\/\/ \]\]>|\]\]>|\/\/\]\]>|\/\/<!\[CDATA\[/" => ""
);
$buffer = preg_replace(array_keys($search), array_values($search), $buffer);
return $buffer;
}

this compresses all html, and javaScript but not CSS for this I use:

compress_styles($buffer) {
// remove comments, tabs, spaces, newlines, etc.
$search = array(
    "/\/\*(.*?)\*\/|[\t\r\n]/s" => "",
    "/ +\{ +|\{ +| +\{/" => "{",
    "/ +\} +|\} +| +\}/" => "}",
    "/ +: +|: +| +:/" => ":",
    "/ +; +|; +| +;/" => ";",
    "/ +, +|, +| +,/" => ","
);
$buffer = preg_replace(array_keys($search), array_values($search), $buffer);
return $buffer;
}

Hope this is useful to someone.

Gia ca phe
tks
It’s very helpful for me

Sair Ecashbot
Really cool feature
Uzo
Oh this piece of code has been godsend to me. Extremely easy to set up and very noticable rewards even on an empty cache. Thanks!
Jody
[code]
ob_start();
// INIT AND START THE APPLICATION
include_once(“includes” . DS . “init.inc.php”);
$sContent = ob_get_contents();
ob_clean();

$content = str_replace(array(“\n”, “\r”, “\t”, “\o”, “\xOB”), ”, $sContent);
[/code]

xo so minh ngoc

compress_styles($buffer) {
// remove comments, tabs, spaces, newlines, etc.
$search = array(
"/\/\*(.*?)\*\/|[\t\r\n]/s” => “”,
“/ +\{ +|\{ +| +\{/” => “{“,
“/ +\} +|\} +| +\}/” => “}”,
“/ +: +|: +| +:/” => “:”,
“/ +; +|; +| +;/” => “;”,
“/ +, +|, +| +,/” => “,”

Johny

Thank you! Could you please give me advice! Im using:

compress_page($buffer) {
$search = array(
    "/ +/" => " ",
    "/||[\t\r\n]||\/\/ ||\]\]>|\/\/\]\]>|\/\/ ""
);
$buffer = preg_replace(array_keys($search), array_values($search), $buffer);
return $buffer;
}

It works well, but my text (in html) has no space after commas!

Hello, how are you? - Hello,how are you?

Compress Your XHTML Page Output Using PHP Output Buffers

The PHP Code

Recent Features

Vibration API

9 Mind-Blowing Canvas Demos

Incredible Demos

Dijit’s TabContainer Layout: Easy Tabbed Content

Create a Sexy Persistent Header with Opacity Using MooTools or jQuery

Discussion